Improving Fine-Grained Visual Classification using Pairwise Confusion
نویسندگان
چکیده
Fine-Grained Visual Classification (FGVC) datasets contain small sample sizes, along with significant intra-class variation and interclass similarity. While prior work has addressed intra-class variation using localization and segmentation techniques, inter-class similarity may also affect feature learning and reduce classification performance. In this work, we address this problem using a novel optimization procedure for the end-to-end neural network training on FGVC tasks. This procedure, called Pairwise Confusion (PC) attempts to learn features with greater generalization, thereby preventing overfitting. This regularization during training is accomplished by intentionally introducing confusion in the activations. With PC regularization, we obtain state-of-the-art performance on six of the most widely-used FGVC datasets and demonstrate improved localization ability. PC is easy to implement, does not need excessive hyperparameter tuning during training, and does not add significant overhead during test time.
منابع مشابه
Visual-textual Attention Driven Fine-grained Representation Learning
Fine-grained image classification is to recognize hundreds of subcategories belonging to the same basic-level category, which is a highly challenging task due to the quite subtle visual distinctions among similar subcategories. Most existing methods generally learn part detectors to discover discriminative regions for better classification accuracy. However, not all localized parts are benefici...
متن کاملEfficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained Visual Categorization
Fine-grained visual categorization (FGVC) has drawn increasing attention as an emerging research field in recent years. In contrast to generic-domain visual recognition, FGVC is characterized by high intraclass and subtle inter-class variations. To distinguish conceptually and visually similar categories, highly discriminative visual features must be extracted. Moreover, FGVC has highly special...
متن کاملTERMINOLOGY AND THE CLASSIFICATION OF FINE GRAINED SEDIMENTARY ROCKS – is there a difference between a claystone, a mudstone and a shale?
Fine grained sedimentary rocks, both clastic and carbonate, are believed to be the most abundant rock type on the Earth‟s surface (Picard, 1971; Blatt, 1982). Fine grained rocks appear to constitute somewhere in the region of 70% (Holmes, 1937) and 80% (Clarke, 1924) of all the sediment ever produced. In sedimentology the size grade scale most commonly used is that which was introduced by Udden...
متن کاملAttend and Interact: Higher-Order Object Interactions for Video Understanding
Human actions often involve complex interactions across several inter-related objects in the scene. However, existing approaches to fine-grained video understanding or visual relationship detection often rely on single object representation or pairwise object relationships. Furthermore, learning interactions across multiple objects in hundreds of frames for video is computationally infeasible a...
متن کاملFine-grained Recognition Datasets for Biodiversity Analysis
In the following paper, we present and discuss challenging applications for fine-grained visual classification (FGVC): biodiversity and species analysis. We not only give details about two challenging new datasets suitable for computer vision research with up to 675 highly similar classes, but also present first results with localized features using convolutional neural networks (CNN). We concl...
متن کامل